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Abstract — We consider a setting in which a single item of content (such as a song or a video clip) is disseminated in a population 
of mobile nodes by opportunistic copying when pairs of nodes come in radio contact. We propose and study models that capture 
the joint evolution of the population of nodes interested in the content (referred to as destinations), and the population of nodes 
that possess the content. The evolution of interest in the content is captured using an influence spread model and the content 
spread occurs via epidemic copying. Nodes not yet interested in the content are called relays; the influence spread process 
converts relays into destinations. We consider the decentralized setting, where interest in the content and the spread of the 
content evolve by pairwise interactions between the mobiles. We derive fluid limits for the joint evolution models and obtain 
optimal policies for copying to relay nodes in order to deliver content to a desired fraction of destinations. We prove that a time- 
threshold policy is optimal while copying to relays. We then provide insights into the effects of various system parameters on the 
co-evolution model through simulations. 

Index Terms — Opportunistic Networks, Influence spread, Content dissemination 
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1 Introduction 

Due to the ubiquity of cellular networks, there has 
been a proliferation of hand-held mobile devices. 
The idea of mobile opportunistic networking is to 
exploit the mobility of users carrying such devices 
to transfer messages directly to each other during 
chance meetings. This is enabled by low-power radio 
interfaces on these devices (such as Bluetooth), and 
provides the opportunity for creating a multi-hopping 
communication network completely bypassing the 
cellular infrastructure. Since such a scheme cannot 
meet delay guarantees, applications that utilize mo- 
bile opportunistic networking must be delay tolerant. 
On the other hand, such a peer-to-peer (P2P) content 
delivery is scalable Q, as the rate of service scales 
in proportion to the number of peers in the system 
with little additional cost to the system planner. Such 
a combination of an application and opportunistic 
transport is also called delay tolerant networking (DTN). 

In this paper, we consider such a networking 
paradigm, and study the problem of dissemination 
of an item of content among the population of mobile 
nodes. The content could be a video (news footage, 
sports highlights, movie teaser, etc.) or an audio file 
(a recent song, a popular ringtone, etc.) The individ- 
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uals carrying the mobile devices also interact socially 
(thus forming a social network) and can influence each 
other to become interested in the content. In such a 
situation, the system objective could be to facilitate 
the spread of the content to as many interested nodes 
as possible. In doing this, nodes that are not (yet) 
interested in the content could be used to relay the 
content, thus making the content more available. The 
overall objective would then be to spread the content 
among those who are interested, while limiting the 
number of copies to those nodes who are not yet 
interested in the content. 

Literature Survey: In this paper, we refer to nodes who 
are interested in the content as destinations and those 
who are not yet interested in the content as relays. In 
the delay tolerant networking context, there is prior 
literature (see |3| and |4| and the references therein) on 
the optimal opportunistic copying of content in order 
to optimize delivery delay and/or wasteful copying 
to relays. 

An interesting aspect, that is largely unexplored 
in prior literature, is the evolution of interest in the 
content. For recently "released" content (such as a 
new video clip), the fraction of nodes interested in 
it (destinations), will not be fixed, but will evolve 
based on the influence of nodes that are already 
destinations. Such influence could be mediated via a 
centralized server, which uses a low bit-rate channel 
to broadcast the current popularity of the content, or 
by interactions between the mobile nodes themselves. 
Thus, it may be necessary to deliver the content 
to destinations while keeping track of the demand 
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evolution. 

A recent work that explores this aspect (5|, uses 
epidemic models to characterize demand evolution 
and aims to obtain a hybrid of P2P and client-server 
architecture to efficiently meet the demand. While 
|5| begins by assuming the Bass model for interest 
evolution, in our work, we analyze the interest evo- 
lution in its own right, adopting the Linear Threshold 
(LT) model |6 | from the viral marketing literature and 
deriving its fluid limit. Also, while in |5] the peer-to- 
peer file dissemination occurs only among the nodes 
interested in the content, we consider the notion of 
relays (as yet uninterested in the content) aiding the 
spread, thus leading to a more general dynamics of 
the co-evolution model. 

Contributions: In this paper we consider a population 
of N mobile nodes, and model their pair-wise meet- 
ings by independent Poisson point processes. The 
item of content is provided to a subset of the initially 
interested set of nodes. The LT model is used to model 
the spread of interest between nodes. A controlled 
epidemic spread model is used for the opportunistic 
copying of the content between nodes. In this frame- 
work, we make the following contributions: 

1) We consider the homogeneous influence linear 
threshold model (HILT) model which is a special 
case of the LT model introduced in Kempe et al. 
|6J. The population processes in the HILT model 
are modeled as a Markov chain, and a fluid limit 
is developed under a certain scaling of the spread 
dynamics as TV — > oo. The fluid limit nicely 
captures the influence threshold distribution via 
its hazard function. The well known SIR epidemic 
model |7| emerges as a special case when the 
threshold distribution is exponential. It is this SIR 
model that is then used in the remainder of the 
paper for modeling the spread of interest in the 
content. 

2) For the case in which influence is spread by 
interaction between the devices, and a controlled 
epidemic spread of the content, we obtain the 
SIR-SI model, for which we obtain the fluid limit 
for a fixed probability, a, of copying to a relay 
node (i.e., an uninterested node). 

3) When the copying probability, a, can vary with 
time, we obtain a controlled o.d.e., for which we 
obtain the optimal control by direct arguments 
using certain monotonicity properties. This re- 
sults in a time-threshold structure of the optimal 
control. We provide an extensive numerical study 
of this model, thus providing additional interest- 
ing insights. 

Outline: In Section |2j we study the HILT model 
for evolution of interest, derive its fluid limit for 
general influence threshold distributions and discuss 
the effect of threshold distribution. In Section [3] we 
study the decentralized model (SIR-SI model) for co- 




Fig. 1 . The graph for the HILT influence model 

evolution of content popularity and availability and 
establish the optimality of a time-threshold policy, for 
copying to relays. Finally, in Section |4j we first solve 
certain optimization problems concerning the evolu- 
tion of interest. We then numerically compute optimal 
policies for the decentralized model, and study the 
effect of system parameters on the time threshold. 

2 Interest Evolution: The HILT Model 

In this section we introduce the homogeneous influ- 
ence linear threshold (HILT) model used to model 
the evolution of interest in the content. In the orig- 
inal Linear Threshold (LT) model |6], the individuals 
are modeled as nodes of a weighted directed graph 
Q = (TV, £), where £ C AfxAf. With each i, j € £, there 
is associated a weight Wij which gives a measure 
of influence of node i on node j, normalized such 
that the total weight into any node is at most 1, 
i.e., J^i w i,j — 1- The Homogeneous Influence Linear 
Threshold (HILT model) |8| is a special case of the LT 
model where the network graph is complete and all 
influence weights are equal. Hence, we have a mesh 
network on the population Af containing N = \J\f\ 
nodes with each edge carrying the same influence 
weight 7jv = and T < 1 (see Figure nj. 

In the context of our content spread problem, at any 
step of influence spread, the population is partitioned 
into destination nodes (i.e., the nodes that want the item 
of content), and relays (i.e., the nodes that, as yet, do 
not want the content). Given an initial set of destina- 
tions, A(0), we denote by A(k), k € {0, 1, 2, • • • } the 
process of the set of destinations. Each node j € Af 
independently chooses a random threshold 9j > 
from a given distribution F(-) at the beginning and 
stays at this value thereafter |6|. In the HILT model, 
the net influence of a set of destinations on any relay 
is -fN times the size of the destination set. Given the 
initial set of destinations and the thresholds sampled 
by all the relays, a relay gets converted to a destination 
when the total influence on it exceeds its threshold. 
In the distributed setting of a mobile opportunistic 
network, the influence can be exerted in one of two 
ways: 

• Centralized spread: The system broadcasts the 
number of destinations over a low bit rate chan- 
nel, thereby causing influence to be exerted on all 
the relays. 
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Fig. 2. Spread of Influence in the HILT Model 

• Distributed spread: Influence is exerted during 
the random meetings between destinations and 
relays. 

In this section we develop a fluid limit for the HILT 
influence spread process, which will then motivate 
the distributed influence spread model that we will 
use in the remainder of the paper for our study of 
joint spread of influence and content. In the process 
of obtaining this model, we also provide the general 
fluid limit for the HILT process, and insights into the 
effect of the threshold distribution. 

We assume the progressive case, i.e., conversion into 
destinations is an irreversible process, i.e., a relay j ^ 
A(k — 1) gets influenced in step k if, 

i N \A(k-i)\>e 3 (i) 

thereby converting to a destination, and then stays 
in that state from then on. The influence on a relay 
node can be viewed as building up cumulatively over 
time. The initial set -4(0) is viewed as being infectious 
and results in some of the relay nodes "tipping" over 
their thresholds and getting converted to destinations 
in the first period. A node j that remains a relay 
has the level of cumulative influence on it raised to 
7at|^4(0)|(< 9 3 ). The nodes in .4(0) are now viewed 
as being non-infectious, and the newly infected nodes 
are denoted by £>(1), with .4(1) = .4(0) + P(l); 
see Figure [2] Thus, at the end of each period the 
population will contain three types of nodes: the 
set of destinations A(k), the set of newly infected 
destinations T>(k) C A(k) and the set of relays, S(k). 
Evidently, the activation process stops at a random 
step T when there are no more infectious destinations, 
i.e., V(T) = 0, at this step the terminal set A(T) is 
reached. 

Need for fluid limit: From |6j it is known that the 
problem of identifying the most influential nodes in a 
social network under the Linear Threshold influence 
model is NP-hard. This is due to the difficulty in 
computing the expected influence (size of the terminal 
set) for a given seed set, which involves listing down 
all the path probabilities in the graph. In |8|, we have 
provided an analytical expression to characterize the 
expected influence of a set. Using this for the HILT- 
model we can provide, a closed form expression for 
the expected size of the terminal destination set A(T). 
But in order to exploit the interest evolution better, 



we would be interested in knowing the expected tra- 
jectory of the size of the destination set A(k), and we 
use fluid limits to characterize this expected evolution 
of the process. 

2.1 O.D.E. Model for Interest Evolution 

In this section we will use the convergence of a scaled 
discrete time Markov chain (DTMC) to a deterministic 
limit described by an ordinary differential equation 
(o.d.e.) ( see Kurtz [9], Darling pOl), to develop de- 
terministic (or, so called, fluid limit) approximations 
to the HILT model for interest evolution. 

Recall that A(k), k > 1, is the set of destinations and 
T>(k) the set of newly added destinations (infectious) at 
the end of period k with X>(0) = .4(0). Define B{k) = 
A{k - 1) with B(0) = 0. Thus, for k > 1, B(k) = 
Uo<i<fc-iIV We will work with sets B(k) and T>(k) 
to derive the fluid limits of the HILT process. Also, 
since the nodes are homogeneous in the HILT model, 
it suffices to record the sizes of the respective sets, 
not the exact membership of the sets themselves. Let 
A(k), D(k), and B(k) be the sizes of the sets A(k), 
V(k), and B(k), respectively. 

We can show that the original HILT process 
(B(k), D(k)) is a Markov chain (see Appendix |A| . 
In order to obtain an approximating o.d.e., we 
work with an appropriately scaled Markov process 
(B N (t),D N (t)), which evolves on a time scale N 
times faster than that of the original system. We can 
visualize this process as evolving over "minislots" of 
duration l/N, whereas the original process evolves at 
the epochs k — 0, 1, 2, • • • . The minislots are indexed 
by t — 0, 1, 2, * * - . Since this new process runs on 
a faster time scale, we scale the amount of change 
it undergoes in each "minislot" by adopting an ap- 
proach taken in (TT| . In the present context, the scaling 
approach can be interpreted as follows. In each minis- 
lot, each infectious destination in D N (t) is permitted 
to influence the relays with probability and its 
influence is deferred with probability 1 — ^. In the 
former case, it contributes its influence of 7^ and then 
moves to the set B (t + 1), otherwise it stays in the 
D N (t + l) set. In the original process, the influence of 
all the newly infected destinations will be taken into 
account when determining the popularity level of the 
content in the next time step, whereas, in the scaled 
process, only those infectious destinations that choose 
to use their influence will be considered^] Define by 
C N (t) C D N (t) the set of infectious destinations who 
use their influence at time t and thereby become non- 
infectious. Then, 

c N (t) = D ^ t ± + z»(t 



1) 



1. The scaling is done in order to obtain an analysis of the system 
as N scales. Since the number of pairs of infectious destinations and 
susceptible relays scales as N 2 , we need to further slow down the 
dynamics by a factor of N and hence the probabilistic scaling. 
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B N (t + l) 



= B N (t) 
= B N {t) 



C N (t) 
D N (t) 
N 



and by applying Equation [T] to the relay nodes j G 

M\A(t), 
D N {t + l) = D N (t)-C N (t) 

-F( lN (B N (t) + C N (t) j) - F( lN B N (t)Y 



E 



l-F(rf N B*r(t)) 
N - B N (t) - D N {t)] + Z$(t + 1) 



where Zj*(t 



1) and 



1) are zero mean 



random variables conditioned on the history of the 
process (B N (0), D N (0), ■ ■ ■ B N (k), D N (k)) represent- 
ing the noise terms, and the expectation in the ex- 
pression for D N (t +1) is with respect to C N (t). 
Define B N (t) = and similarly C N {t) and D N (t) 

as fractions of the total population. (B N (t), D N (i)), 
t = 0, 1,2, ••• is a DTMC on the state space 
[0,i,f,...l]x[0,i,f with B N (t)+D N (t) < 1. 

Then we can state the following theorem on the 
convergence of the scaled process to a deterministic 
limit. We assume that the threshold distribution F(-) 
has density /(•). 

Theorem 1 : Given the scaled HILT interest evolution 
Markov process (B N (t),D N (t)), with bounded first 
derivative of the density /(•) of the threshold distri- 
bution, we have for each T > and each e > 0, 

N ([Nu\),D N ([Nu\))-(b(u),d(u))\\>e 



sup I (B 1 

0<u<T 



N- 







where (b(u), d(u)) is the unique solution to the o.d.e., 

b = d 



f{Vb)Td 
1 - F{Tb) 



(1-b-d) 



with initial condition (6(0) = 0,d(0) = o(0)). 

Proof: This is essentially an instance of Kurtz's 
Theorem J9j. See Appendix [B] for the steps involved 
in deriving the fluid limit and a detailed proof verify- 
ing the necessary conditions for Kurtz's theorem for 
the HILT model. Since the o.d.e. drift equations are 
Lipschitz, they satisfy the Cauchy-Lipschitz condition 
and the system has a unique solution once the initial 
condition is fixed. □ 
Remark: The hazard function |12| for the cdf F(x) 



is given by hp{x) = i^p(x) wnere f( x ) is me cor " 
responding probability density function. Hence the 
o.d.e. becomes 

b = d (2) 

d = h F (Tb)Td(l - b - d) - d (3) 

Here hp(Tb)Td can be interpreted as the rate of con- 
version for the relays, where (1 — b—d) is the current 
fraction of relays in the population. 
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"0 2 4 6 t8 10 12 14 16 18 
Fig. 3. Convergence of the scaled HILT Markov 
chain (B N (k),i) N (k)) to the o.d.e. limit (b(t),d(t)). The 
o.d.e. solution (b(t),d(t)) plotted along with sample 
paths of the scaled process {B N (k),D N {k)) for N = 
50,100,500,1000. 

2.2 Accuracy of the O.D.E. Approximation 

Consider the HILT process of interest evolution, under 
the special case of uniform distribution of influence 
thresholds, i.e., 6^ ~ U[0, 1]. The hazard function 
corresponding to uniform distribution is given by, for 

< x < 1, 

h F (x) 

1 — x 

The corresponding o.d.e. s then become: 

b = d (4) 

d = (l-b-d)-d (5) 

Figure [3] illustrates the convergence of the scaled HILT 
process (B N ([Nt\), D N ([Nt\)) to the solution of the 
above o.d.e. (b(t),d(t)) for increasing values of N 
(50,100,500,1000). T and d Q were chosen to be 0.9 
and 0.2 respectively. We see that for TV = 1000 the 
o.d.e. approximates the scaled HILT Markov chain 
very well, as expected from Theorem [T] 

As we pointed out earlier, the scaled HILT process 
(B N (t), D N (t)) correctly captures the average dynam- 
ics of the original HILT process. Hence it can be 
expected that the o.d.e. solution evaluated at the ends 
of the original slots (indexed by k > 1), will be a 
good approximation to the average of the process 
{B{k),D{k)). This is illustrated in Figure |1J where 
multiple sample paths of the original discrete time 
HILT process, (B(k),D(k)), are superimposed on the 
o.d.e. solution, (b(t),d(t)). The good match confirms 
that the fluid limit obtained from the particular scal- 
ing of the Markov chain, retains the average be- 
haviour of the original influence spread process. 

2.3 Effect of the Threshold distribution 

In the HILT model, while T is indicative of the total 
level of influence each individual can receive from the 
others, the threshold distribution F(-) captures the 
variation among the individuals' susceptance levels 
for getting interested the content. An empirical analy- 
sis on the effect of threshold distributions on collective 
behavior was provided in ||13|. Having established 
the o.d.e. limit and studied its ability to approximate 
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Fig. 4. Multiple sample paths of the unsealed HILT 
process (B(k), D(k)) for N = 3000 is plotted along with 
the o.d.e. solution (b(t),d(t)) for r = 0.9 and d = 0.2 

the evolution of interest process, we can now exploit 
the o.d.e. limit to study the effect of the threshold 
distribution. 

2.3. 1 Uniform Distribution 

For the HILT model of interest evolution under the 
uniform threshold distribution, we can state the fol- 
lowing theorem. 

Theorem 2: Given the initial fraction of destinations 
do, in an HILT network with influence weight Y under 
the uniform threshold distribution, define r = 1 — Y + 
Tdo. Then, 

1) The fluid limit for the evolution of interest is 
given by, 

do d _ rt 



6(f) 
d{t) 



r r 

-rt 



d e~ 



(6) 
(7) 



2) The final fraction of destinations is given by 

boo = 1 p rf " , (8) 
1 - T + Td 

Proof: The first part of the theorem is obtained by 
explicitly solving the o.d.e.s for the HILT model under 
uniform distribution (Equations [4] and [5j for the initial 
conditions 6(0) = 0, d(0) = d . □ 
The linear threshold model under the uniform 
threshold distribution has been studied in discrete 
setting for general networks in |6|. Since in the HILT 
model all nodes are homogeneous, we will be inter- 
ested in the influence of a set of size k. Consider the 
HILT network with N nodes and influence weight 7 at. 
Let I 7N (k) be the expected size of the terminal set of 
destinations A(T), starting with Ao of size k as the 
initial set of destinations. By using results from |8J, 
we can show that, 

I 1N (k) = k[l + (N — k) lN [l + (N-k- 1) 1N [1 + ■■■ 



oNj N 

J™(fc) 



In the expression for I lN (k), noting limjv- 
r and do = jj, we can show that as N — > oc, x 
boo = —■ This reconfirms the fact that the fluid 
model is consistent with the discrete formulation. 
Also, while |8| allows us to compute only the final 
fraction of destinations, our current work provides a 
good approximation of the actual trajectories of the 
influence process. 
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Fig. 5. Final fraction of destinations as predicted by the 
fluid limit boo (bold lines) and by the discrete formulation 

1 (k) 

T y (dashed lines)plotted for various values of r and 
do and N = 3000. The gap seen between and 



r(fc) 



N 



decreases to zero as TV ->• 00. 



Remarks: Figure |5] shows the behaviour of boo for 
various of T and do. Plotted alongside are the values 



of ^ 



(fc) 



jy — for the corresponding values of 7at and k for 
N = 3000 and it can be seen that the two solutions 
match very well. It is easy to note that when T = 0, the 
HILT process does not have any effect on the number 
of destinations, i.e., the fraction of destinations and 
relays in the population remains a constant, as, for 
example, in |3|. We observe from the fluid limit that, 
as long as T < 1 we cannot influence the entire 
population (i.e., 600 < 1) unless we start off with the 
entire population to be destinations (i.e., do = !)• But 
if r — 1, then the influence of the destinations are 
at the maximum, and we can ultimately convert all 
nodes into destinations, given we start with a non- 
zero fraction of destinations, i.e., boo = 1 provided 
d > 0. 

2.3.2 Exponential Distribution 

For the uniform distribution the hazard rate function 
is hp(x) = j-zj: which is monotonically increasing 
in x. Such a hazard rate implies that the population 
consists of relay nodes that are more susceptible to 
getting interested in the content as the total num- 
ber of destinations increases (i.e., total accumulated 
influence over the past increases). The exponential 
distribution with parameter A yields hp(x) = A, a con- 
stant hazard rate function. This implies a memoryless 
property for the influence process, i.e., the relay nodes 
are equally likely to get influenced at a given time 
instant, irrespective of the net accumulated influence 
in the past. We can then state the following theorem. 

Theorem 3: Given the initial fraction of destinations 
do, in an HILT network with influence weight Y under 
exponential distribution with parameter A, 

1) The fluid limit of the evolution of interest is given 
by the solution to the o.d.e., 

b = d 

d = XYd(l-b-d)-d 

2) The final fraction of destinations is the solution 
to the transcendental equation, 

boo = 1 - (1 - do)e- A ™~ 
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Proof: The first part is obtained by substituting 
hp{x) = A in the HILT o.d.e. (Equations [2] and [3b. 
This is equivalent to the SIR epidemic model with 
infection rate Ar and recovery rate of 1 17). Note that 
b(t) is then equivalent to the Recovered set (R), and 
d(t) is equivalent to the Infected set (I). The second 
part follows from classic SIR literature by observing 
that the basic reproduction number (expected number 
of new infections from a single infection) is Ar. □ 
Remarks: We see that an HILT model with ex- 
ponential distribution with parameter A, and edge 
weights 7jv = is equivalent, in fluid limit, to 

an SIR model with infection rate Ar and recovery 
rate 1. Alternatively, if we consider the SIR model 
as modeling epidemic spread over a population of 
nodes, with pairwise meetings occuring at Poisson 
points of rate A, then in order to get an infection 
rate of Ar, we need to consider a r-thinned version 
of the Poisson process, i.e. when an infectious node 
meets a susceptible node, it passes the infection with 
probability T. This interpretation would be crucial 
while considering the decentralized version of the 
influence spread process. 

3 Joint Spread of Interest and Con- 
tent: SIR-SI Model 

In this section, we aim to model the joint spread 
of interest in the content and the content itself. We 
do this by combining the earlier analysis on interest 
spread (essentially the SIR model) with a controlled- 
epidemic copying process for content spread (proba- 
bilistic control, similar to the Susceptible-Infective (SI) 
model (14)). 

Let k £ N index the epochs of pairwise meetings 
or "recovery" of a node (from the SIR literature). 
Pertaining to the content, each node has a want state 
and a have state. In order to model the content delivery 
process, we further classify the nodes depending on 
whether they have the content (i.e., based on the 
have bit). Recall A(k), the set of destinations and let 
S(k) = J\f\A(k) be the set of relays. We further classify 
the destinations as either infectious (D (&)) or non- 
infectious (B(k)). Let X{k) C A(k) denote the set of 
destinations that have the content, and y(k) C S(k) 
denote the set of relays that have the content. Let 
Xb(k) and Xd(k) respectively be the intersection of 
X(k) with the sets B(k) and V(k). 

Pairs of nodes meet at the points of a Poisson 
process with rate A^v = ^, and a relay node gets 
converted into a destination, if it meets an infec- 
tious destination, and the infectious node succeeds in 
transmitting the influence (modeled by the influence 
weight r resulting from the HILT model). We also 
include the recovery rate /3/v, the rate at which in- 
fectious destinations become non-infectious. For the 
SIR model derived from the HILT model, (3^ = 1, 
but in this case we wish to work with a generic 



/8jv. For the evolution of (X(k),y(k)), we model a 
content copying process based on the Susceptible- 
Infective (SI model). Whenever a node that has the 
content meets a node that does not, content transfer 
takes place based on a controlled copying process. We 
always copy to a destination that does not have the 
content, whereas copying to a relay is controlled by 
a probability a £ [0,1]. We wish to obtain the fluid 
dynamics of this model. 

In the combined model, underlying the content 
delivery process (the SI model), the influence process 
(the SIR model) converts relay nodes into destinations. 
Thus, in our setup, the fraction of destinations is 
time-varying (unlike |3|). Also, the content spread is 
dependent on the interest evolution but not vice versa. 
An interesting feature of these co-evolution models 
is the importance of copying to a relay node. As a 
content provider, we might be interested in delivering 
only to the destinations (interested in the content). 
But, there are two advantages of copying to a relay. 
First, copy to a relay promotes the further spread of 
the content even to destinations; this is the aspect 
explored in a controlled Markov process setting in 
|3 |. Second, the relay we copy to now might later get 
influenced to become a destination. 

3.1 System Evolution 

The set of all possible transitions among the dif- 
ferent states of nodes is shown in Figure [6] Note 
that the process evolves at epochs t/. indexed by 
k which are either pairwise meetings (occuring at 
rate Ajv|A/"|(|7V| — 1)) or instances of recovery of an 
infectious destination (occuring at rate j3T>(k)). The 
system state is represented by the tuple Z(k) = 
(B(k),D(k),X b (k),X d (k),Y(k)), the sizes of the re- 
spective sets. The dependence on N is implied, and 
not explicitly indicated in the notation. Z(k) is a 
continuous time Markov chain; in Table [7] we show 
its transition structure. The type of the epoch is given 
by the membership of the node(s) which are involved 
in the pairwise meeting or recovery, and determines 
the state update Sk at time Also note that, when 
a node from X d (k) meets a node from S(k)\y(k), 
there is a possibility of both influence spread and 
content copy. In such cases, we assume that the at- 
tempt to influence spread precedes content copy. If 
the influence succeeds (occurs with probability T), the 
relay node becomes a destination and is immediately 
given the content. If the influence fails, then it is 
treated as a relay, and the content is copied with the 
probability a (control). Thus, the state updates depend 
on whether a potential influence succeeded (occurs 
with probability T) and whether a potential relay node 
received the content (occurs with probability a). The 
rate of various epochs and the corresponding state 
updates are listed in Table [7] The system state does 
not change, for any other pairwise interaction, and 
hence the corresponding S k = (0,0,0,0,0). 
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""** Influence spread (SIR Model) 
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Influence & Content spread 



Fig. 6. Possible transitions between the states in the 
SIR-SI model. Observe that, in addition to the transi- 
tions due to SIR model and SI model separately, there 
are instances where both can occur simultaneously. 
For instance, when an infectious destination node i e 
X d meets a relay node j e S - y, the relay might get 
converted to a destination and also receive the content, 
as indicated by the diagonal transition. 



Epoch type i 


Rate R k [i) 


State update S^(i) 


V - X d recovers 


lM,D(k)-X d (k)) 


(1,-1,0,0,0) 


X,! recovers 


0nX d {k) 


(1,-1,1,-1,0) 


B - X b meets X + y 


\ N (B(k) - X b (k))(X(k) + Y(k)) 


(0,0,1,0,0) 


V - X d meets y 


\ N (D(k)-X d (k))Y(k) 


(0,0,0,1,0) 

+ (0,1,0,1,-1) w.p. r 


V- Xd meets X 


\ N (D(k) - X d {k))X(k) 


(0,0,0,1,0) 


X,i meets y 


X N X d (k)Y{k) 


(o,i,o,i,-i) w.p r 


S-y meets X k + y 


\ N (X b (k)+Y(k))(S(k)-Y(k)) 


(0,0,0,0,1) w.p. a 


S-y meets V - X d 


\ N (D(k)-X d (k))(S(k)-Y(k)) 


(o,i,o,o,o) w.p. r 


S — y meets X d 


X N (X d (k))(S(k)-Y(k)) 


(o,i,o,i,o) w.p. r 
(o,o,o,o,i) w.p(i - r> 



Fig. 7. Transition Rates and state updates for various 
possible epochs 



3.2 Drift Equations 

The evolution of system state can be given by Z(k + 
1) = Z(k) +Sk, where Sk depends upon the type of the 
epoch. We can express the expected drift of the system 
by appropriately using the rates of various epochs. 
Define Z(k) = Z(k)/N and define the mean drift rate 
as follows: 

ie£ 

where i e £ indexes the epoch type, Rk(i) and 5k(i) 
as given in Table [7] See Appendix [C] for the exact 
expressions for the mean drift rates. 
Consider the o.d.e.s given below. 

b = fid ' (9) 

d = -f3d + XTds (10) 
Xb = (3x d + \(b - x b )(x + y) (11) 
Xd = F\(d - x d )y + TXx d s + (12) 
X(d - x d )(x + y) - px d 

ij = -T\dy + Xa(s-y)(x b + y+(l-r)x d X13) 

We can then state the following result. 
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Fig. 8. Convergence of the scaled coevolution process 
Z N (t) to the o.d.e. solutions z(t) for increasing values 
of N with (7 = 1 
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Fig. 9. Convergence of the scaled coevolution process 
Z N (t) to the o.d.e. solutions z{t) for increasing values 
of N with cr = 0.3 

Theorem 4: Given the coevolution Markov process 



we 



Z N {t) 

have for each T > and each e > 0, 

sup \\Z N ([Nu\) - z(u)\\ >e^ A 

0<n<T 



where z{u) = (b(u),d(u),Xi,(u),x d (u),y(u)) is the 
unique solution to the o.d.e. system given by Equa- 
tions with z(0) = (0, d(0), 0, Xd(0),0). 

Proof: The proof involves verifying the conditions 
for applying Kurtz's theorem to the SIR-SI process 
(see Appendix [C}. Since the drift equations are Lip- 
schitz, the uniqueness of the solution is guaranteed 
once the initial condition is fixed, by the Cauchy- 
Lipschitz condition. □ 



3.3 Accuracy of the O.D.E. Approximation 

Figures [8] and [9] illustrate the convergence of the scaled 
coevolution process Z N (t) to the o.d.e. solutions z(t) 
for increasing values of N for a = 1 and a = 0.3. 
We plot a(t) = b(t) + d(t), x(t) = x b {t) + x d {t) 
and y(t) for clarity. Observe that as N increases, the 
approximation of the original process by the o.d.e. 
becomes better. Observe that when a is increased from 
0.3 to 1, there is significant increase in y(t) but the 
contribution to x(t) is not significant. Also, for static 
control, x(t) asymptotically reaches a(t) irrespective 
of a and y(t) asymptotically reaches s(t) = 1 — a(t) 
provided a > 0. Note that from the o.d.e. equations, it 
is clear that, x(t) is monotonically increasing, whereas 
in general y(t) need not be monotonia 
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Fig. 10. Convergence of the scaled co-evolution pro- 
cess Z N (t) to the o.d.e. solutions z(t) for increasing 
values of N with dynamic a(t), a time-threshold func- 
tion in this case, <r(t) = 1, t < 4 and a(t) = for t > 4. 

3.4 Copying to Relays: Static vs. Dynamic Control 

In the above section we considered a to be a static 
control, i.e. a(t) = a, Vt. We could instead consider 
a dynamic copying control a(i) 6 [0,1]. Then for 
each JV we have a controlled continuous time Markov 
chain. The optimal control appears difficult to obtain 
in the finite size problem. We instead replace the 
constant copying probability in the ODE limit by ait) 
which yields a controlled ODE. We can then obtain 
the optimal (deterministic) control for the controlled 
ODE (as in |3.6) . In certain situations it can be shown 
that this control is asymptotically optimal for the finite 
size problem as N increases p5|. The proof given in 
1 15 1, for discrete time MDPs with finite time horizon, 
does not directly apply here, and needs to be extended 
to accommodate our setting. We do not prove this, 
and henceforth treat cr(t) as a heuristic, and obtain 
the optimal deterministic control. 

In order to justify this assumption, the convergence 
of the co-evolution process to its o.d.e. limit is numer- 
ically demonstrated in Figure [10] for a time-threshold 
type control. In this case, a(t) = 1, t < 4 and a(t) = 
for t > 4. Note that when a{t) = 0, y < from 
Equation {13) . Observe that the trajectory of x(t) is 
very similar to the one obtained when a = 0.3, but 
the value of y(t) for large t is considerably lesser. 
This indicates that this threshold policy (r = 4) will 
perform better than the static policy a = 0.3, in the 
context of the objective function to be posed in the 
next section. 

3.5 The Optimization Problem 

In 1 3 1, since the fraction of destinations was constant, 
it was suitable to choose the time of delivery to a 
fraction a of destinations as the objective to be mini- 
mized. In our setup, since the fraction of destinations 
is time-varying, we define the target time, 

T a = inf{t : x a (t) > aa(oo)} 

where a(oo) is the terminal fraction of destinations 
as given by the SIR model for interest evolution 
(b(t), d(t)) and x a (t) is the fraction of destinations that 



have the content at time t. Note that since d(oo) = 
and a(t) = b(t) + d(t), a(oo) = 6(oo). 

The cost we are interested in optimizing is of the 
form „ 



C a = ipy<j(T a ) + T a = ipyviTv) + 



Idt 



(14) 



Here y a (T a ) denotes the number of relays that have 
the content at time T a and signifies the number of 
wasted copies, and ip is the tradeoff parameter. 

Even though the copying cost is distributed across 
the nodes, we treat the number of wasted copies at 
the target time T a as part of the system objective. 
This can be motivated by considering an incentive 
mechanism for the relay nodes which still hold the 
content but are not converted into destinations by the 
target time T a . Consider an example, where the con- 
tent is a discount coupon for some service/product. 
The service provider, initially provides these discount 
coupons to a subset of its customers, who are then 
encouraged to spread replicas of the coupon. Nodes 
that are interested in the service, and are in possession 
of the coupon immediately claim the service. The 
service provider stops accepting discount coupons 
once a pre-defined number (here aa(oo)) of coupons 
have been used. At this time, the relay nodes that 
are still in possession of the coupon will need to be 
provided a reimbursement for helping the spread of 
the coupon. 

3.6 Optimal Control 

In this section we will establish the optimality of a 
time-threshold control for the objective given by the 
Equation |T4} . We will use definitions and lemmas 
provided in in order to prove the following theorem. 

Theorem 5: For the o.d.e system given by Equa- 
tions |9|-|[T3| there exists an optimal control of the 
form, 

n\ / 1, <t < t . . 

<Tr(*) = | 0> t > T (15) 



which optimizes the cost in Equation (14 1. 

Proof: Recall y a (T a ) is the amount of wasted 
copies, at the target time T a . When a(t) = cr T (t), a 
time-threshold policy (Equation (15|), we will denote 
this by y T (T T ), where r is the time threshold associ- 
ated with a T (t). 

The sketch of the proof is as follows. We first 
establish that, the set of values taken by y T (T T ) form 
an interval of [0,p max ]. Then, given any policy cr(t), 
we show that: 

• If y<j{Ta) — P < Pmax/ then 3 a time-threshold 
policy a T {t) such that y T {T T ) = p and T T <T a . 

• If y a {T a ) = p > p max , we can find a time- 
threshold policy <J T (t) which has a smaller total 
cost. 

Thus in either case, we have a time-threshold policy, 
which performs at least as good as the given policy, 
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Fig. 1 1 . Evaluation of optimal r for the time-threshold 
policy. The plot shows the behaviour of T T and y T (T T ) 
for various values of r, for fixed system parameters. 

which proves the optimality of the time-threshold 
policy. See Lemmas 5j [6] and [7] of Appendix [D] for 
proofs of the above claims. 

□ 



Fig. 12. Variation of versus T for various values of 
the required target /? with Y = 0.9. Notice that the value 
of p for each curve is the intercept of the curve with the 
vertical axis 

1r 



Figure 11 shows the variation of T T , y T (T T ) and 
C T as a function of r for fixed system parameters. 
It can be seen that as r is increased, T T decreases 
monotonically and we see an increase in the value of 
y T (T T ). The optimal threshold r* minimizes the total 
cost C r , by balancing the two component costs, taking 
the tradeoff parameter ip into account. 

4 Numerical results 

In Sections [2] and [3] we have studied Markov models 
for influence spread and for the co-evolution of inter- 
est in an item of content and the spread of the content 
in a mobile opportunistic network. The spread of 
content can be controlled by the probability of copy- 
ing the content to uninterested nodes. We established 
fluid limits for these Markov models and illustrated 
their accuracy via simulations. In this section, we pro- 
vide an extensive parametric numerical study of the 
fluid models, and report insights that can be obtained. 
While the first part of the section deals with optimiz- 
ing for the interest evolution process considered in 
isolation, the latter part deals with optimizations for 
the co-evolution process. We will study the interest 
evolution under the uniform threshold distribution, 
since the SIR model (which arises out of exponential 
threshold distribution) is well studied in the literature. 

4.1 Interest Evolution 

Content creators are often interested in understand- 
ing the evolution of popularity of their content, and 
would wish to maximize the level of popularity 
achieved. This, in our model, is equivalent to the final 
fraction of destinations (nodes that are interested in 
receiving the content). In most cases, the content cre- 
ator does not have control over the influence weight Y 
or the threshold distribution F(-) of the population. In 
order to increase the spread of interest in the content, 
the only parameter that can be controlled is do, the 
initial fraction of destinations in the population. In 




0.1 0.2 0.3 0.4(3 0.5 0.6 0.7 0.8 0.9 1 
Fig. 13. Variation of versus the required target /3 for 
various values of Y at T = 15 



Section \2.1\ we derived the relation between do and 
boo, the final fraction of destinations. We might be 
interested in choosing the right do which can give 
us the required b^, and we see that by rearranging 
Equation |8| we get, 

boo(i-r) 



do = 



b^Y 



(16) 



As discussed in Section 2.1 



the above equation is 
applicable for T < 1. When T = 1, from Equation (|8j 
we see that 6^ = 1 as long as we start with do > 0. 

Since the o.d.e. provides a good approximation for 
the temporal evolution of interest, we can also obtain 
results for the time taken by the process for the spread 
of influence. 

Theorem 6: Given the initial fraction of destinations 
do in an HILT network with parameter Y, the time we 
have to wait to get the final fraction of destinations 
to be at least /3 < ™) is given by, 



T(P,d ,Y) 



r 



1-r 



where r = 1 — Y + Ydo- 

Proof: Firstly, note that since = b^ 
we cannot reach j3 > Since we are observing the 
process at a finite time T, d(T) is not zero. Hence, we 
consider a{T) = b{T) + d(T) and set it to (3. We get, 

d (--(--l)e^ 
V r 



a(T) 



do 



(17) 



Rearranging terms,we get the expression for 
T(p,d ,T). ' □ 

Another interesting question would be to determine 
the initial fraction do to be chosen so that by time T 
we will have at least a fraction (3 of the nodes in the 
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destination set, in the HILT network with parameter 
r. This can be solved numerically using the following 
equation obtained from Equation (17) . 

1 - 



A. 



,-rT 



1 



Substituting for r 



i - r - rd Q , 



nr, 



-rd 



1 



ff(i-r) 

do 



r(i - do) 

Let H(d ) and G(d ) denote the LHS and RHS of 
the above equation. We know that d^ that satisfies 
the above equality will lie in [ i~gr ' ^] anc ^ ^ na ^ * ne 
solution is unique, since a(T) is a monotonic function 
in do. Also observe that H(do) is decreasing in do 
and G(d ) is increasing in do. Hence for do < dj, 
H(d ) < G(d Q ) and for d > d*, i?(d ) > G(d ). 
Under the above conditions, we can use the iterative 
bisection method that will converge to dj. 

The variation of d^ with respect to the parameters j3, 
r and T can be seen in Figures 12 and 13 As expected, 
for fixed T and T, a higher value of /3 requires a higher 
do (Figure [13} and if the network has a high T, then it 
is sufficient to start with a smaller do to reach a given 
/3 by T (Figure [13) . Observe that, as T is increased 
(the time constraint is relaxed), the required initial 
d decreases, from j3 for T — and asymptotically 
approaches d = ^rrg (obtained by setting — (3 



i-r/3 

in Equation (16j) (Figure |12j 



4.2 Decentralized Influence Spread 

Having observed that the optimal control is of the 



form 15 we can now numerically compute the opti- 



mal time threshold, which we shall refer to as r*. 
Recall the cost function is of the form, 

C a = T a + i\>y a {T a ) = ^y a {T a ) + [ ' Id/ 



where T a is the target time to reach a fraction a of 
the terminal fraction of destinations under the policy 
cr(t),i.e., 

T a = inf{t : x a {t) > aa x } 

Since in this case, a(t) = a T * (i), we shall denote the 
total cost by C T *, and the two components of the 
cost by T T * and y T * (T T *). In the following discussion 
we shall numerically study the effect of various cost 
parameters, system parameters and initial conditions 
on t*,C t +, T r * and y T *{T T ,). 

4.2. 1 Effect of initial conditions 



Figure 1 14 1 shows the effect of d on the optimal r*. 
We see that as we increase do keeping the fraction of 
constant, there are more destinations that have 



do 



the content. Further, there are fewer relays in the 
population, and with fixed T and f3, there is less 
chance for them to get converted into destinations. 
This prevents us copying to more relays and hence 
there is a decrease in the value of r*. Note that 



y .(T 



a = 0.95, f =4 
r=0.8,p = 1>0.7 
d(0)=d ,x d (0) = 0.5*d 




0.5 



0.2E 



i.1 0.2 0.3 0.4 0.5 ° 0.6 0.7 0.8 0.9 



Fig. 14. Decentralized Influence Spread: Effect of d 
on the optimal cost and optimal r 
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Fig. 15. Decentralized Influence Spread: Effect of ^j 1 
on the optimal cost and optimal r 



when d = 1, the entire population consists of only 
destinations, and hence the optimal r = 0. 

Figure (15} shows the effect of Xd on the optimal r*. 
As the fraction is increased, since a larger number 
of destinations have the content, the newly infected 
relays also obtain the content. Observe that -j 2 - = 1 
implies that all the initial destinations are given the 
content. This implies that any destination converted 
in the future, will automatically receive the content. 
This alleviates the need to copy to any of the relays, 
since the main purpose of copying to relays is to serve 
the destinations without the content (either because 
they were destinations not given the content initially 
or were converted by other destinations without the 
content at a later time). 

4.2.2 Effect of system parameters 



Figure 1 16 1 shows the effect of T on r*. We see that 
as r increases, it increases the rate at which relays 
get converted into destinations, without affecting the 
rate at which content copying occurs (dependent on 
the meeting rate A and copying probability cr(t)). 
We do not see much effect on r*, and T T *. This is 
because, though increasing V increases the terminal 
set of destinations (and thus the target aa(oo)), it also 
helps in achieving the target by the conversion of 
relays with the contents into destinations with the 
content. Also due to this conversion, we observe a 
decrease in y T (T T ), further indicated by the fact that 
y(t) < s(t) and the fraction of relays s(t) is decreasing 
rapidly 
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Figure |T7] shows the effect of (3 on the optimal r*. 
We observe that when f3 is very low, destinations re- 
main infectious for a longer duration before recovery. 
This leads to a faster decay in s(t) the size of relays 
and also an increase in the target chz(oo), and hence 
allowing /requiring a more aggressive copying policy 
(larger r). Also for very large (3, the total fraction of 
destinations carrying the content would be less, and 
hence to make the content more available, we need to 
copy to more relays. Also, since in this case, the rate 
of conversion to destinations is much lower, we note 
that a relay with the content has much lesser chance 
of getting converted into a destination. This explains 
the increase in y a (T,j). 

Figure ( |T8| shows the effect of A on the optimal 
t*. As A increases, the rate at which content spread 
increases, and thus a destination in need of the content 
is highly likely to obtain it from another destination. 
This results in a more passive policy of copying to 
relays, leading to a decrease in the optimal t* and 
the corresponding costs. 

4.2. 3 Effect of cost parameters 

Figure (19) shows the effect of a on r*. For the given 
system parameters, = 0.4352 and hence it is 
meaningful to start from a = 0.3. As a increases, we 
see that t* increases and in turn increases the costs, 
since we need to continue copying to relays for a 
longer duration. Figure |20| shows the effect of ip on 
t*. As ip increases, we see that the emphasis on the 
wasted copies, y(T), increases, and hence the control 



needs to be less aggressive. Thus we see a decrease in 
t* as ip increases, and this leads to a decrease in the 
wasted copies (y T *(T T *)) and an increase in the delay 
of reaching the target (T T *). We see an increase in the 
total cost C T * since both the terms in the cost function 
(T T * and ipy T * (TV*)) are increasing. 

5 Conclusions 

In this paper we studied the joint evolution of popu- 
larity and spread of content in a mobile opportunis- 
tic setting. We studied the evolution of popularity 
under a threshold based model (HILT model) and 
derived its fluid limits. We also showed that under 
the exponential distribution for the threshold, the 
fluid limit of the HILT model is a special case of 
the SIR model. We then develop a fluid limit for the 
combined spread of interest and the content(the SIR- 
SI model). Finally we showed that a time-threshold 
policy is an optimal copying policy for the SIR-SI 
model, to optimize the combined cost of target time 
and the amount of wasted copies. We also have 
reported several interesting insights into the evolution 
of popularity and the co-evolution of popularity and 
content delivery, which will help the content pro- 
ducers and distributors understand the interplay of 
various system parameters. 

Appendix A 

Proposition 1: For the HILT model, 
(B(k),D(k)),k > 0, is a discrete time Markov 
chain (DTMC). 
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Fig. 20. Decentralized Influence Spread: Effect of ij> on 
the optimal cost and optimal r 



which depends on the "history" only via (b, d), and 
thereby establishes the desired result. ■ 



Appendix B 

Fluid Limit for the HILT Model 

Recalling from Section |2j the evolution of 
(B N (t), D N (t)) can be written in terms of its 
mean "drift" at any t as follows. 

B N (t+l) = B N (t) ' ° N{t) 



z b N (t 



N 



D N (t + l) = 



N — 1 - 



D N {t) 



Proof: Since, for each k > 0, B(k + 1) = B(k) + 
D(k), it suffices to show that, for k > 0, 

P(D(k + 1) = e\(B(0), D(0)), ■ ■ • , ((B(k),D{k)) = (b, d))) 

is a function only of (6, d). We need the following 
simple lemma (in which the new notation is local to 
the lemma). 

Lemma 1: Given (Xi,X2,--- ,X n ) i.i.d. random 
variables with common c.d.f. G(x),x > a, where 
a > 0, and given b > a, define 

Z = {i : 1 < i < n,Xi > b}, and Z = \Z\. 



N 



E 



X il ,Y i =X ia ,-" ,Y z =X iz .Then 



P{Y l <y l ,l<t<Z\Z = m) 



nr 



G( V i) - G(b) 
1 - G(b) 



for yi > b, 1 < i < m. 

Proof: (of Lemma [TJ It is easily seen that, for yi > 

b,l < i < m, 

P{Z = m 1 Y 1 < Vl ,--- ,Y m <y m ) 
n 

1 (G(b)) n - m n? =1 {G{ yi ) - G{b)) 



and 



P{Z = m) = 



(G(b)) n - m (1 - G(b)) T 



from which the desired result immediately follows. ■ 
Returning to the proof of Proposition [TJ given 
(B(0), £>(())),••• 7 {{B{k),D{k)) = (b,d)), via a recur- 
sive application of Lemma [T] (starting from the ini- 
tial i.i.d. thresholds ®i, 1 < i < N, with common 
c.d.f. F{6)), we conclude that the thresholds of the 
to := N - (b + d) users in TV \ (B{k) U X>(fc)) are i.i.d. 
with common c.d.f. x~j^jp~ i over the range 9 > jj. 
At the end of period k, the newly influenced users in 
V{k) will serve as additional influence on the as yet 
uninfluenced users. Defining, 



1 - F(-yb) J 



Of the users in N \ (B(k) U V(k)), I will become 
interested (i.e., D(k + 1) = £) with probability 

' V ~ {b + d) ^ Pl (b, d)\l - Pl {b, d))^-^-* 



F{ lN N(B N {t) + C N {t))) - F{ lN N{B N {t)) 
l-F{ lN N{B N {t)) 
(1 - B N \t) - D N (t)) + Z% (t + 1) 

where we have defined ZF(t) and Z^ (t) in an 
analogous manner. The mean drift rate function, 
g N (B N (k), D N (k)) per unit update step size 
comes, 

g N (B N (t) 7 D N (t)) 



— be- 

n ue 



j_ 

N 



Index the indices in Z as ■ ■■ ,iz), and let Y\ = w here 



3f(.) = ^ v (*) 



N i 



NE 



F( lN N(B N (t) + C N (t)))-Fh N N(B N (t)) 



l-F( lN N(B N (t)) 
1 - B N (t) - D N (t)] — D N (t) 



Recall x (N — 1) = T and f(x) the density function 
of the threshold distribution F. We can then show that 
(see Appendix p), 

F( lN N(B N (t) + C N (t))) - F( lN N(B N (t)y 



lim NE 



l-F( lN N(B N {t)) 



f{Tb)Td ( , 
l-F(Tb) V 1 " 



Letting gx(b, d) — d and g%{b, d) 
and define, 

g(b,d) := ( gi (b,d),g2(b,d) 



f(Tb)Td 
1 - F(T6) 

-b-d)-d 



We can then use these limiting drift functions to 
obtain the fluid limits corresponding to the HILT 
model. 

Consider _ 

- F(-yN(B N (t) + - F{-yN(B N (t)y 

1 - F(~fN(B N (t)) 



limpf^^NE 



where the expectation is with respect to C N {t) 
given (B N (t), D N (t)). Applying Taylor's expansion to 

F(jN(B N {t) + C N {t))) around B N (t) we get, 
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lim 



N—yoa 



NE 



F(jN(B (t) + C N {t))) - F( 1 N(B N (t)) 



lirriN 
+lirriN 



1- F(~/N(B N (t)) 

1 NC N (t)f( 1 NB N (t)) 



NE 



1 - F{~/N(B N (t)) 

C N (t) 2 



,NE 



1 N- 



-/(C) 



1 - F{ 1 N(B N {t)) 



In the second term since C N (t) 
C N (t) ~ B±n(D N (t), i), we have 



c (t) 

' where 



E(C N {t) 



1 

N 2 
1 

N 2 
1 

iV2 



E{c n (ty) 

D N (t)-(i-h + ( DN( ^) 2 



1 



Also /(C) is bounded, since we require the drift 
functions to satisfy the Lipschitz condition. Hence the 
second term vanishes as N — > oo. In the first term, 
noting that -/N ->• T and E(C N (t)) = given 
B N (t) = b, D N (t) = d, the limit becomes ^(rl) - 

Kurtz's theorem [9| provides us a way by which 
we can, subject to certain conditions, approximate 
the evolution of a pure jump Markov process by the 
solution of a derived ODE. Along the lines of Theorem 
2.8 in pO|,to prove the theorem [T] we need to verify 
the following four equivalent conditions for the HILT 
model. Let 51(6, d) = d and g2(b, d) = prgrffjpj (1 — b — 
d) — d and define, 

g{b,d) := ( gi(b,d),g 2 {b,d) 



We require 

• Lipschitz property of g(b, d) 

• Uniform convergence of to g(b, d) 

N 

• Bounded Noise variance 

• Convergence of initial conditions 
(i) Lipschitz property We have, 

d9i =0 .9h =1 

db ' dd 



dg2 
db 



Y 2 dh F (Tb){\ -b-d)- Tdh F (Tb) 



dg 2 
dd 



hf(Tb)T{l-b-2d)-l 



Under the assumption that /(•) is bounded, we 
see that each of the terms above is bounded when 
(6, d) € [0, 1] x [0, 1 — b]. Thus the norm of Jacobian 
d)\\ is uniformly bounded, and it follows 
that g(b,d) — (gi(b,d),g2(b,d)) is Lipschitz. 
(ii) Uniform Convergence Let g N (b, d) be as defined 
in Appendix [B] By definition, lim7v->oo ^7 = T 
and by the steps in Appendix IB] the uniform 



convergence of 9 to g(b, d) in the domain 
(6, d) e [0, i, • • • , N] X [0, i, • • • , 6] is proven, 
(iii) Bounded Noise variance We will write the noise 
variance terms for the scaled processes B N (k) 
and D N (k), and from them derive the noise 
variance terms for B N (k) and D N (k) by scaling 
them by ^. Let Z^(k) and Z${k) be the zero 
mean random variables representing the noise in 
the drift terms of B N (k) and D N (k) respectively. 
We can then write, 

E{\Z b N (k)f\F k ] = (l-l)l^(fe) 



E[\Z»(k)\ 2 \F k ] 



V N N K ' 

z=D N {k) 



E 

z=0 



g{z){l-g{z)){N-B N {k)-D N {k)) 



D (k) 



N 



(D N (k)-z) 



where, 

9{z) 



F(jN(B (k) + z)) - F(/yNB (k)) 
1 - F(~fNB N {k)) 



Note that both these terms can be upper bounded 
by constants say and Cd, observing that (b, d) € 
[0, 1] x [0, 1—b] and hence the noise conditions for 
B N (k) and D N (k) are satisfied, 
(iv) Convergence of initial conditions By choice, we 
have B N (0) = 6(0) and D N (0) = d(0). 

The above four conditions validate the application of 
Kurtz's theorem and thus proves Theorem [T] 



Appendix C 

Verification of Kurtz's theorem condi- 
tions FOR SIR-SI MODEL 

The mean drift rates for the SIR-SI model can be given 

by, 

F b N (k) = /3 N D(k) 

F^(k) = -f3 N D(k)+N\ N r£)(k)S(k) 

F x N b (k) = f3 N X d (k) + 

NX N (B(k) - X b (k))(X(k) + Y(k)) 

F£{k) = TNX N (D(k)-X d (k))Y(k)~ 
[3 N X d (k) +TNX N X d {k)S(k) + 
NX N (D(k) - X d (k))(X(k) + Y(k)) 

F^(k) = -TNX N D(k)Y(k) + 

NX N a(S(k) - Y(k))(X b (k) + Y(k) + (1 - T)X d (k)) 

and the corresponding o.d.e. equations, 
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b 
d 

y 



0d 

-0d + XTds 

j3x d + X(b - x b )(x + y) 

TX(d- x d )y + TXx d s + 



(18) 
(19) 
(20) 
(21) 



X(d-x d )(x + y) - 0x d 
-TXdy + Xa(s - y)(x b + y + (1 - T)x d l22) 



Let fb,fd,fx„,fx d ,fy denote the right hand 
sides of the above fluid limit equations 

and let / := (fb, fd, fx h , fx d , fy)- Also let 
F N (k) := (F b N (k),F»(k),F£(k),FZ(k),F»(k)). 

(i) Lipschitz property We see that each of the partial 

derivatives ^ t where u, v G (b,d,x b ,x d ,y) is 
bounded when (b,d,Xb,x d ,y) € [0,1] x [0,1 — 
b] x [0, b] x [0, d] x [0, 1 — b — dj. Thus the norm 
of Jacobian is uniformly bounded, and it follows 
that f(z) is Lipschitz. 

(ii) Uniform Convergence Taking NXn — >• A and 
Pn — > /3 we see that the uniform convergence 
of F N (z) to f(z) is straightforward. 

(iii) Bounded Noise variance Since in the co- 
evolution system the maximum jump rate out of 
a state is bounded X N N(N — l)+/3 N N, and since 
NXn -> A and 0n — > 0, the jump rate is O(N); 
the increments for the scaled process Z (.) are 
0(N~ 1 ). This is referred to as "hydrodynamic 
scaling" |10|. This ensures that the noise conver- 
gence to zero is ensured. 

(iv) Convergence of initial conditions The 
initial conditions are chosen such that 
(B N (0),D N (0),X»(Q),X»(0),Y N (0)) 
(&(0),d(0),x 6 (0),x d (0),y(0)) 



Appendix D 

Let w(i) be the solution of the o.d.e w = /(w; z) with 
piecewise Lipschitz continuous control z(t), where / 
is continuously differentiable and Lipschitz in w and 
z. Let w^\t) and w^(t) be the trajectories corre- 
sponding to two controls z^'(t) and z^'(t) respec- 
tively i.e., 



,(2) = 



/(w«;z«) 



(23) 
(24) 



Motivated by the Kamke condition (see |T6)) we 
define Kamke-dominance between two controls z^ (i) 
and z^ (t) as follows. 



Definition 1: We say z ( >(t) Kamke-dominates z ( >(t) 
in the system f(w;z) if for each t where w^(t ) > 



w^>(t ) and 3i with w\ ' (t ) = wf'(t ) then 
/i(wW(t );* (1) (to)) > fi(wW(to);zW(t )) 



Lemma 2: Let wW(t) and w^ 2 \t) be as in Equa- 
tions <|23) and <|24j, and z (1) (t) Kamke-dominate z&(t) 
in the system /(w; z). If wW(0) > w< 2 )(0) then Vt, 

W «(t) > W< 2 )(<). 



Proof: The proof of Proposition 1.1 in |16) can 
be adopted directly in writing this proof. Since / is 
continuously differentiable and Lipschitz in w and z, 
by the Cauchy-Lipschitz condition, we have a unique 
solution given the initial conditions. Let ^^(w^^O)) 
denote the solution of p3| with initial condition 
wW(0), and <^ 2) (w( 2 )(0)) denote the solution of 1 24 
with initial condition w( 2 )(0). We wish to show that 
<f>t (w^fO)) > <j>t (w( 2 ^(0)). Consider m an integer, 
and let (fq '' m (w^(0)) be the solution corresponding 
to 

= /(w 



Then, by continuity of o.d.e. solutions with respect to 



drift and initial conditions |17 Chap.l, Lemma 3.1], 

0i 1),m (w( 1 )(O) + ^) defined on < s < t for all large 

m, say m > M and 

(D,™ (w (i) (o) + ^ ) _ >(/) (i) (w (i) ( O)) (25) 
m 

asm-) 00, uniformly in s E [0,i]. We claim that for 
< s < t, for all m > M, 

0« m (w«(O) + -) » 0i 2 V 2 >(O)) (26) 



where x » y implies x t > y it Vi. 

Proof of claim: Fix m > M. We know that 
w«(0) + ^ = </>P' m (wW(0) + A) » w( 2 )(0) = 
0o 2) ( w(2) (°))- B y continuity of </> (1) and (f> {2) , the claim 
d26l holds for small s. If the claim 1 26 1 were false, 



3 < < < t such that §26} holds for < s < t , and 
an index i such that, 



and 

d 

ds 



(i).- (w W (0) +^ ) ) < 
m 4 



ds 



(rf«(w W (0))), 



(27) 



But since Vy ^ i 



(^ 1)ro (w (1) (Q) + ^)),>(4 o 2) (w^(0))) 3 



and zW(t) Kamke-dominates z( 2 '(i) in the system 
/(w, z), we have 

M4 1),ro (w (1) (0) + -J);* (1) (M) 



1 

m 



/,(^ 2) (w( 2 )(0));z( 2 )(t )) 



>/ 1 (^ > ' m (w (1) (0) + -);, (1) (to)) 
m 



> 



which implies 

d (tfV(o)4)) 4 > d 

S — to 



ds 



ds 



(2) (w< 2 )(0))) i 



This contradicts ( |27[ and hence proves the claim p6[ |. 
Applying ( p5| to this claim completes the proof of the 
lemma. □ 
Consider the system of equations representing the 
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evolution of (xb(t), Xd(t),y(t)) in the SIR-SI process. 

x b = f3x d + X(b - x b )(x + y) (28) 

x'd = TX(d - x d )y + TXx d s + (29) 
X(d - x d )(x + y)~ fad 
y = -TXdy + Xa(s-y)(x b + y+(l-r)x d X30) 

Let w« = (x£\x$\yM) and = 
(x^ , x d 2 \yW) be the solutions respectively for 
the controls o^{t) and cr^ 2 \t) for the above system 
with CT (1) (t) < cr {2) (t). Define 

(u b ,u d ,v) := (4 1) ,4 1) , 2/ W)-(4 2) ,4 2) , 2/ ( 2 )) 

and Acr(i) = o-W(t) — fr^(tj. Then we can write, 

Ub\u b =o = /3u d + A(6-a;[ l 1) )(u (i + u) (31) 

u d k=o = rA(d-4 1 V + A(d-4 1) )K + «) (32) 
v\ v =o = X(s-yW)[AayW +aWu b + A*x { b 2 \33) 
+(1-T)(cj^u d + Aax d 2) )} 

Lemma 3: Control Domination 

(a) If Vt, crW(t) > cr (2) (i)^ then a^(t) Kamke- 
dominates cr' 2 )(t) in the system given by equa- 
tions |28}-|j30). 

(b) If Vt, yl 1 ^) > y (2) (^), then yW(t) Kamke- 
dominates y^ 2 \t) in the system given by equa- 
tions @-(^). 

Proof: 

(a) From equations (|3T|-([33|. Since Act > 0, we find 
that, 

• if Ud, v > and u b = 0, then iif, > 

• if Wh, w > and w<i = 0, then iid > 

• if > and v = v > 

This verifies the conditions of Definition [lj and 
thus proves (a). 

(b) From equations |3T|-|[32). Since v > 0, we find that, 

• if Ud > and u& = 0, then u b > 

• if Mh > and Ud = 0, then u d > 

This verifies the conditions of Definition [lj and 
thus proves (b). 

□ 

Lemma 4: (a) If Vt, a^(t) > a {2 \t), and te^O) > 
w (2 \0), then Vt, w^(t) > w^(t) 
(b) If Vt, yW(t) > yV>(t), and (^(O), x^(0)) > 
(4 2) (0),4 2) (0)),thenVt, 

(4 1) (t),4 1) (t))>(4 2) (t),4 2) (t)) 

Proo/' The proof follows by applying Lemma [2] to 
the systems in Lemma [3] (a) and (b). □ 

Define w(t) = (xb(t),x d (t),y(t)) We will now use 
the above lemmas to prove the claims in Section |3~6] 
in order to establish the optimality of a time-threshold 
policy. Recall that a has been fixed earlier. Recall the 
definitions of T a and y a {T a ). We shall replace them 
with T T and y T (T T ) whenever a(t) = ov(t), a time- 
threshold policy. 

Consider r such that r > T T —: r' . Evidently, T T > = 
r 1 , and y T (T T ) = y T ,(T T ,) t Vr > r'. 



Define T = {r : r > 0} and T = {r : r < T T } C T. 
Observe that {p : 3 t e T,y T {T T ) = p} = {p : 3 r e 
T, y T {T T ) = p}. Hence we limit our discussion to r e 

r. 

Lemma 5: The set {p : 3 r e T,y T (T T ) = p} forms 
an interval of the form [0, /9 roax ]. 

Proof: Setting r = 0, i.e., cr(t) = , Vt, we see 
from Equation ( |13| , since y(0) = 0, we have y(t) = 0, 
Vt. This establishes that p = belongs to the set. It is 
also easy to observe that p max is achieved by r such 
that r = T r . By the intermediate value theorem, it now 
suffices to show that y T (T T ) is a continuous function 
of T. 

Consider r,r + 6 G T, i.e., r < T T , r + S < T T+S . 
Observe from Lemma gj for <5 > 0, T T+S < T T . With 
the above constraints, evidently the only case to be 
considered is 

t < r + 5 < T T+5 < T T 

Let w T (.) and w T+ s(.) be the trajectories corre- 
sponding to cr r (t) and (T T+ s(t), with w r (0) = w T +a(0). 
Since a T (t) — <j T+ $(t), < t < r, it follows that 
w t (t) = w r+ a(r). Further, from Equations 1 11 H 13' 
we observe that, 

||w t (t + S) - w t+ s(t + <5)|| 2 < CiS, where 
C x = y/(p + A) 2 + (2rA + A + P) 2 + (A + TX) 2 

Vt > r + 5, CT T (t) = CT T+ 5(t) and hence by continuity of 
o.d.e. system w.r.t initial conditions at r + <5 implied 
by the Lipschitz nature of / w.r.t w (see |l8")), w T (.) 
is continuous w.r.t t. 

Recall that t < t + 5 < T T+5 < T T . By the just 
observed continuity, for e > 0, we can obtain 5 > 
such that, \x T+s (T T+s ) - x T (T T+s )\ < e , i.e., 

|aa(oo) - x T (r r+ a)| < e 

However over the interval (T T+ $,T T ], the rate of 
increase of x T (.) is bounded below by Aa(t) 2 (l — a)a 
(from Equations (TT|-|[T3|). Hence, 

\T T+ s-T T \< 



Xa(t) 2 (l — a)a 



which can be made small by choosing an appropriate 
S > 0. We have thus shown that T T continuous w.r.t 
r. r e T. 

To show the continuity of y T (T T ), consider, 
kr(7V) - y T+s {T T+s )\ < 

\Vt{T t ) - y T (T T +s)\ + \y T {T T+s ) - y r+ 5(T r+5 )| 

In the above equation, the first term on the right hand 
side can be made arbitrarily small by the continuity 
of T T w.r.t r and the fact that y T (.) is a continuous 
trajectory, and the second term can be made arbitrarily 
small by the continuity of w T (.) w.r.t r. Thus y T (T T ) is 
a continuous function of r, in the space of threshold 
policies. And since y T (T T ) : r — >• p and r e [0, oo), we 
see that p e [0,p max ]. □ 
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Lemma 6: Let a(t) be a policy such that y a (T a ) < 
Pmax- Then there exists a threshold policy cr T (t) whose 
cost is no worse than that of cr(t). 

Proof: Since p := y a (T a ) < p m ax, there exists r > 
such that y T (T T ) = p. We will argue that a T (t) is such 
that T T < T a . 

Let w r (t) and w CT (t) be the system trajectories for 
the controls a T (t) and er(i) respectively with w T (0) = 
w CT (0). By definition, for t < t, cr T (t) > a(t) and hence, 
by Lemma |ij w T (r) > w CT (r). 

Suppose, contrary to the claim, T T > T a . Evidently, 
this cannot happen if for aE t, < t < T a , y T (t) > 
y a {t)- This is because, by Lemma |2J we will have 
x T (t) > x a (t) and hence T T < T a . 

Hence, there exists t , r < i < T a , such that 
Vr{to) = y<r(to). Then we have y T {t ) = ya{h) > P 
(since y T {T T ) = P and r < t < T T , and for t > r, 
y T (t) < 0). Also, for t > t , a T (t) = < a(t). 
Thus from ([13}, Vs € {s : s > t ,y T (s) = y a (s)}, 
y T {s) < y a {s)- Thus for s > t Q , y T (s) < y a (s). Since 
Vt{T t ) = ya(T a ) = p, this implies T T <T a . 

□ 

Lemma 7: Consider a non-threshold policy a(t) 
which achieves p > p max - Consider the time-threshold 
policy a(t) of the form, 

&(t) _ ( 1, < t < sup{t : <r(t) > 0} 
\ 0, otherwise 

Then a(t) policy achieves a smaller total cost (given 
by Equation [14} than a(t). 

Proof: Let a(t) and o>(i)(t) be as above. Let 
Wf (t) and Tf be as defined earlier for time-threshold 
policies, corresponding to o>(t). Then by Lemma p] 
yt (Tf) < p max < P = y<j{T<j), and by Lemma |4j 
Tf < T a , and hence the o> if) policy achieves a smaller 
total cost than a(t). □ 
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